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A  BSTRACT 


This  thesis  develops  an d  explores  the  graphical  analysis 
of  multivariate  data  sets  through  tne  use  of  a  Draftsman 
technique  of  scatter  plot  displays.  These  plot  displays  are 
useful  for  determining  associations  and  relationships 
between  variables  in  order  to  promote  an  understanding  of 
the  characteristics  of  the  data  in  exploratory  and  descrip¬ 
tive  applications.  General  graphical  enhancement  techniques 
such  as  jittering  and  tr ansf ormati ons  are  discussed  and 
incorporated  in  the  development  of  a  computer  program  which 
produces  Draftsman  displays.  A  technical  description  of  the 
Draftsman  computer  program  is  presented,  and  user  implemen¬ 
tation  procedures  discussed.  An  analysis  is  conducted  on 
two  varied  sets  of  data  to  demonstrate  the  versatility  and 
utility  of  the  Draftsman  display  technique  for  exploring 
data  structures. 
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IN  TRODuCTIOH 


A.  MOUVAT  ION 

Recent  advances  in  computer  Hardware  and  sofware  capa¬ 
bilities  have  made  available  to  a  larger  number  of  users 
powerful  diagnostic  and  analytical  tools  for  exploring  data. 
These  same  advances  however  are  responsible  for  a  tremendous 
increase  in  the  amount  of  data  produced  and  available  for 
analysis.  Contrary  to  mathematical  intuition,  the  avail¬ 
ability  of  more  data  availanle  does  not  always  lead  to 
greater  precision  in  subseguent  analysis.  Often  the 
increased  amount  of  data  confounds  tne  analysis  by  overbur¬ 
dening  our  ability  to  process  the  information  in  a  timely 
and  understandable  fashion. 

Graphical  displays  are  a  method  of  visually  portraying 
vast  amounts  of  qualitative  information.  The  primary 
benefit  of  graphical  techniques  is  that  the  human  eye-brain 
system  has  a  powerful  information  processing  capability.  3y 
maximizing  our  visual  capability  to  process  properly 
displayed  data,  we  can  rapidly  summarize  information,  focus 
on  salient  features,  discern  abberations,  and  extract 
details  of  interest  from  a  data  set. 

B.  SCOPE 

The  purpose  of  a  Draftsman  display  is  to  use  the  visual 
impact  of  an  array  of  two  dimensional  scatter  plots  to 
analyze  multivariate  data.  This  can  be  accomplished  by- 
arranging  an  exhaustive  series  of  plots  consisting  of  all 
paired  variables.  This  enables  the  analyst  to  observe  the 
influence  of  each  variable  on  every  other  variable  in  the 
data  set. 


The  concept  of  using  two  dimensional  scatter  riots  to 
display  higher  dimensional  data  structures  is  discussed  m  a 
text  ty  Chambers,  et  al.  [Ref.  1],  The  ideas  from  that 
text  served  as  the  foundation  for  tne  development  of  an 
interactive  computer  program  to  construct  Draftsman 
displays.  Tne  additional  features  of  enhancing  scatter 
plots  such  as  the  jittering  of  discrete  data  values  and 
transforming  variables  can  also  be  applied  to  this  multidi¬ 
mensional  display  procedure.  The  full  considerations  that 
went  into  the  program  development  as  well  as  user  implemen¬ 
tation  procedures  is  amplified  in  later  chapters. 

The  purpose  of  this  thesis  is  to  integrate  the  graphical 
concepts  of  scatter  plots,  jittering,  and  transforming  vari¬ 
ables  into  a  Draftsman  display.  Although  written  in  A 
Programming  Language  (APL)  ,  little  if  any  knowledge  of  this 
language  is  required  to  successfully  utilize  the  program. 

This  thesis  has  been  written  in  three  major  segments  in 
order  to  appeal  to  the  widest  audience  possible.  The  first 
segment,  composed  of  chapters  II  and  III  deals  with  the 
general  concepts  of  graphical  methods  and  user  instructions 
required  to  invoke  the  Draftsman  display  program.  Tne 
second  segment,  comprised  of  chapter  IV  and  Appendix  A,  is 
aimed  at  those  readers  interested  in  tne  technical  details 
and  Craftsman  program  documentation.  The  final  segment, 
found  in  chapters  V  and  VI,  contains  a  stepwise  analysis  of 
two  varied  focms  of  data  to  demonstrate  potential  applica¬ 
tions  of  this  procedure  in  exploratory  data  analysis. 

The  graphs  used  in  this  paper  were  produced  by  an  exper¬ 
imental  APL  package  GRAFSTAT,  whicn  the  Naval  Postgraduate 
School  is  using  under  a  test  agreement  with  tne  IBM  Watson 
Research  Center,  Yorktown  Heights,  New  York.  We  are 
grateful  to  Dc .  P.D.  Welch  and  Dr.  ?.  Heidelberger  for 
making  GRAFSTAT  available  to  us. 


A.  DATA  DISPLAYS 


II.  GRAPHICAL  TECfiMIiUES 


A  variety  of  graphical  methods  sucn  as  cox  plots,  nisto- 
grams,  stem  ana  leaf  displays  and  scatter  plots  are  avail¬ 
able  to  explore  relationships  which  may  exist  between 
variables  of  a  data  set.  The  scatter  plot  is  perhaps  one  of 
the  most  powerful  graphical  methods  for  displaying  bivariate 
data.  The  foremost  feature  of  tne  scatter  plot  is  that  all 
of  the  data  of  interest  is  readily  displayed  for  visual 
interpretation.  In  addition,  the  simplicity  of  construc¬ 
tion,  compactness  of  the  display,  and  adaptability  to  otner 
graphical  ennancement  techniques,  contribute  to  the  power  of 
this  display. 

In  contrast,  numerical  summaries  may  reflect  correlation 
but  tell  little  aoout  clustering,  patterns,  or  other  rela¬ 
tionships  whicn  mignt  be  present.  This  is  particularly  true 
of  larger  data  sets  consisting  of  more  than  twenty  observa¬ 
tions  and  more  than  two  variables.  In  these  larger  data 
sets,  the  sheer  volume  of  data  points  to  be  compared  makes 
interpretations  a  tedious  and  time  consuming  process. 

Figure  2.1  is  a  scatter  plot  of  weight  versus  engine 
displacement  for  106  different  models  of  cars  produced 
during  1S83  [Ref.  2  :pp.  320-356  ].  A  numerical  summary  might 
readily  impart  the  fact  that  an  increase  in  car  weight  is 
associated  with  an  increase  in  engine  size.  The  scatter 
plot  however  rapidly  makes  apparent  some  other  interesting 
features.  We  can  see  that  the  observations  consist  of  two 
distinct  groupings.  For  vehicles  under  3,000  lbs  there  is  a 
strong  positive  linear  dependency  between  weignt  and 
displacement.  For  the  heavier  venicles  over  3,000  Ins, 
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Figure  2.1  Basic  Scatter  Plot. 
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plots  consist  of  engine  displacement  versus  .eight  and 
turning  radius.  This  arrangement  of  tne  plots,  while  some¬ 
what  redundant,  allows  the  viewer  to  scan  across  rows  or 
down  columns  of  plots,  thereby  matcning  up  points  that 
correspond  to  the  same  observations  in  different  plots. 


Figure  2.2  Three  Dimensional  Draftsman  Display. 

Observing  the  bottom  row  of  plots  in  figure  2.2  ,  we  can 
track  three  distinct  groupings  of  points  through  all  the 
paired  plots.  These  three  groups  correspond  to  the  small, 
medium,  and  large  size  categories  of  vehicles,  k  quick  look 
at  the  associations  exhibited  in  this  display  indicates  also 
that  engine  displacement  has  a  tigater  relationship  with 
weight  than  it  does  to  turning  radius.  Other  relationships 
are  also  evident  and  are  presented  in  greater  detail  in  the 


B.  JITTERING  OF  VARIABLES 

In  certain  circ unstanc es,  a  scatterpiot  may  itself  be 
visually.  deceiving  due  to  the  overlapping  of  data  points 
within  the  plot.  This  situation  may  be  particularly  preva¬ 
lent  when  one  or  both  of  the  plotted  variables  have  a 
iiSited  range  of  discrete  values.  In  order  to  alleviate  the 
overlapping  and  enhance  the  actual  relationsnips  that  exist, 
a  small  amount  of  random  noise  may  be  added  to  one  or  botn 
of  the  variables  to  "jitter"  their  horizontal  and  vertical 
locations  witmn  the  plot.  The  amount  of  random  noise  added 
or  subtracted  from  the  original  data  values  must  be  suffi¬ 
cient  to  prevent  overlapping  but  small  enough  so  that  the 
original  data  values  can  be  recovered  by  rounding  to  the 
nearest  whole  number.  Typically  the  random  noise  added  is 
two  to  five  percent  of  the  total  range  of  the  variable 
values.  [Ref.  1  :pp. 106-107] 

The  visual  difference  resulting  from  jittering  can  be 
seen  in  figure  2.3,  where  the  maintenance  records  for  1981 
versus  1982  was  plotted  for  106  automobile  models. 
Maintenance  is  a  category  variable  with  values  of  0,  1,  ..., 
5.  Clearly  a  problem  of  overlapping  exist  in  the  basic 
scatter  plot  seen  on  the  left.  The  jittered  version  on  the 
right  is  a  more  accurate  picture  of  the  distribution  and 
clustering  prevalent  in  the  data. 
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Figure  2.3  Dnjittered  and  Jittered  Plots. 


m 


C.  TRANSFORMATION  OF  VARIABLES 

The  primary  purpose  of  employing  transf ormations  is  to 
linearize  and  simplify  the  ooserved  relationsnip  between  the 
variables  plotted.  In  many  instances  the  plots  may  be 
further  enhanced  through  the  use  of  transf ormations  m  order 
to  achieve  a  simpler  and  more  understandable  picture  suit¬ 
able  for  visual  comparisons  and  exploration.  Hoaglin 
[Ref.  3  :p.  104  ],  proposes  the  following  pertinent  reasons 

for  employing  transformations. 

1.  Facilitate  interpretation  in  a  natural  way. 

2.  Promote  symmetry  in  a  batch. 

3.  Promote  stable  spread  in  several  batches. 

4.  Promote  straightline  relationships  between  the  vari¬ 
ables. 

5.  Simplify  the  structure  of  a  two  way  or  higher  dimen¬ 
sional  data  structure  so  that  a  simple  additive  model 
can  assist  in  the  understanding  of  the  characteris¬ 
tics  of  the  data. 

A  key  factor  of  transforming  the  variables  is  that  if 
the  correct  transformation  is  applied,  the  resulting  scatter 
plot  will  appear  more  linear  in  form.  This  in  turn  visually 
enhances  recognition,  detection  of  deviations  and  outliers, 
and  assist  in  observing  relationsnips  or  patterns. 

As  previously  discussed,  the  basic  scatter  plot  of 
weight  and  engine  displacement  is  divided  into  two  distinct 
groups  of  points  as  seen  in  the  left  plot  of  figure  2.4  . 
While  the  lower  group  appears  fairly  linear,  the  upper  group 
is  more  dispersed  and  curved  in  shape.  The  plot  on  the 
rignt  shows  the  effect  of  applying  a  log  transform  to  the 
engine  displacement  values.  The  resulting  plot  of  the 
transformed  data  becomes  more  linear  over  the  entire  range 
of  engine  displacement  values  (see  figure  2.4  ). 
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Figure  2.4  An  Example  of  Transformation 


A  note  of  caution  is  appropriate  in  determining  when  to 
use  transformations  in  the  Draftsman  display.  Since  trans¬ 
formations  result  in  a  cnange  to  the  displayed  values  and 
scale,  care  must  be  taken  to  avoid  confusion  during  subse¬ 
quent  analysis.  He  should  insure  that  the  benefits  of 
describing  the  data  with  a  transformation  is  greater  than 
the  loss  of  simplicity  incurred  through  its  use. 


III.  DRAFTSMAN  OSER  INSTRUCTIONS 


A.  GENERAL  GUIDANCE 

The  Draftsman  program  was  written  in  APL  and  is  designed 
to  be  used  in  conjunction  with  the  experimental  IBM  graphics 
software  GRAFSTAT.  The  Draftsman  program  is  interactive  and 
reguires  little  knowledge  of  APL  to  use.  The  APL  versed 
user  can  easily  modify  the  basic  program  and  called  subrou¬ 
tines  for  more  specialized  forms  of  analysis. 

The  grapaical  software  which  generates  the  Draftsman 
displays  reguires  the  use  of  either  the  IBM  3277GA  or 
3278/79  graphic  display  terminals  'Ref.  4].  Normally  these 
terminals  are  available  as  public  facilities  with  special 
accounts  and  passwords.  Once  logged  on  to  one  of  these 
terminals  the  user  may  link  back  to  tneir  own  account  and 
copy  any  of  their  own  files  as  desired-  This  is  useful  in 
retrieving  data  files  which  the  user  wishes  to  analyse  with 
a  Draftsman  display.  [Ref.  5] 

B.  USER  REQUIREMENTS 

This  section  will  provide  a  brief  overview  of  the  user 
inputs  reguirad  to  generate  a  Draftsman  display  of  a  data 
set.  An  explicit  step-by-step  description  of  all  input 
requirements  is  found  at  the  conclusion  of  this  chapter  in 
figures  3.2  through  3.6  . 

Since  the  Draftsman  program  is  written  in  APL,  the  user 
will  have  to  enter  the  APL  sub- envir on ment  in  order  to  gain 
access  to  the  graphics  programs.  Once  in  the  APL  environ¬ 
ment,  the  APL  characters  set  is  invoked  by  keying  the  APL  ON 
key.  These  APL  characters  are  found  in  red  and  supercede 
the  normal  keys.  It  is  recommended  that  the  first  time  user 


taice  a  few  minutes  to  familiarize  taemselves  with  the  loca¬ 
tions  of  these  characters. 

The  API  environment  will  allow  the  user  to  copy  and 
retrieve  both  the  GRAFSTAI  and  Draftsman  programs  as  snown 
in  figure  3.2  .  Once  these  programs  are  in  the  workspace 
the  basic  set  up  procedure  is  complete  and  tne  user  is  ready 
to  actually  initiate  the  Draftsman  program  to  produce  a 
display.  The  Draftsman  program  is  initiated  by  typing 
DRAFTSMAN  folLowed  by  return.  The  program  will  respond  with 
a  series  of  terminal  gueries  reguesting  the  various  input 
parameters  required  in  the  display.  Each  query  is  generated 
based  upon  the  user  response  to  the  previous  query.  The 
general  program  schematic  and  input  requirements  is  outlined 
in  figure  3.1  . 


I  he  first  option  presented  is  that  of  imputing  a  data 
set  (figure  3-3).  Data  which  nas  ceen  previously  copied 
fro  a  another  APL  workspace  may  re  entered  ty  variarie  name. 
Data  whicn  is  located  on  a  CMS  file  can  he  automatically 
read  into  the  workspace.  Data  in  a  CMS  file  can  contain 
only  numeric  characters.  A  mixture  of  numeric  and  alpha¬ 
betic  cnaracters  will  result  in  the  data  not  being  read  in 
correctly.  A  crucial  requirement  is  tnat  regardless  of  how 
the  data  is  entered  it  must  be  in  two  dimensional  array  form 
(rows  and  columns).  The  columns  of  the  data  correspond  to 
the  variables,  the  rows  to  the  different  observations  on 
each  variable. 

Once  the  basic  data  has  been  entered  the  user  is 
presented  with  an  option  to  have  either  all  of  the  data  or 
only  a  subsample  of  the  data  appear  in  the  display.  This 
allows  Draftsman  displays  to  be  produced  on  either  all  the 
data,  specified  variables,  a  subpopulation  of  a  variable,  or 
any  combination  thereof  (figure  3.4  ). 

Based  on  the  data  selected,  an  option  will  be  presented 
to  enter  the  appropriate  names  of  the  variables  which  will 
appear  in  the  display  (figure  3.5  ).  These  names  are  the 
labels  whicn  will  appear  on  the  axis  of  the  plots.  The 
variable  names  can  be  entered  as  a  previously  generated  APL 
two  dimensional  array  of  characters.  If  this  method  of 
input  is  selected,  each  row  of  the  array  must  contain  tae 
name  of  a  variable  in  the  same  order  as  the  variable  is 
located  in  the  data  structure.  Tne  variable  names  may  a^so 
be  entered  directly  in  response  to  a  sequential  series  of 
queries.  Once  the  variable  names  are  entered,  the  minimum 
input  requirements  needed  to  produce  a  Draftsman  display- 
have  been  completed.  The  remainder  of  the  queries  pertain 
to  display  enhancements  which  may  oe  invoked  if  desired. 

The  first  enhancement  option  is  that  of  jittering 
(figure  3.6  ).  An  input  of  0  will  result  in  no  jittering  of 
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the  data.  If  jittering  is  desired,  tne  user  will  he  queried 
as  to  which  variables.  The  results  of  jittering  appear  only 
in  the  Draftsman  display  and  do  not  permanently  alter  any  of 
the  values  in  the  original  data  set. 

The  second  enhancement  option  available  is  transforma¬ 
tion  of  variaoies  (figure  3.6  ).  Here  again,  a  response  of 
0  will  result  in  no  transformations  occuring.  If  one  or 
more  transformations  are  desired  the  user  is  prompted  for 
the  variables  and  API  expression  for  tne  transformation.  A 
summary  for  some  of  the  more  common  transformations  with 
examples  is  illustrated  in  Table  I  . 


TABLE  I 

Sample 

APL  Transformations 

TRANSFORM 

MATH  FORM 

APL  EXPRESSION 

LOG 

LN  X 

•  X 

LINEAR 

Ax  +B 

B+AxX 

CUBIC 

X3 

X*3 

CUBE  ROOF 

. 

X\-1/3) 

SQUARE 

X2 

X*2 

SQUARE  ROOT 

x-i/2 

X*(-  1/2) 

J 

The  Draftsman  program  will  begin  to  display  the  compo¬ 
nent  scatter  plots  on  the  graphics  screen.  The  entire 
display  is  generated  in  segments  of  five  variaoies.  At  the 
end  of  each  segment  an  option  is  offered  for  the  user  to 
quit,  continue,  or  to  make  a  nardcopy  and  continue. 
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Piyure  3.3  Data  Input  Options 


Will  I'll 


17.  DR AFTSHAN  TECHNICAL  IHPLE MENTATION 

A.  EASIC  DRAFTSMAN  ECUTINE 

The  Draftsman  program  was  written  in  API,  which  is  an 
array  processing  language.  The  use  of  APL  enables  the 
Draftsman  program  to  call  and  implement  a  variety  of  plot¬ 
ting  functions  available  in  the  ISM  GRAFSTAT  graphical  soft¬ 
ware  package.  The  GRAFSTAT  software  is  an  experimental 
graphics  program  currently  under  development  by  IBM.  It  is 
presently  available  at  the  Naval  Postgraduate  School  for 
testing  and  evaluation  purposes.  [Ref.  4] 

A  secondary  benefit  derived  in  using  APL  is  the  inherent 
user  efficiency  characteristics  in  terms  of  the  large  number 
of  mathematical  operations  executable  directly  as  keyboard 
entries.  This  approach  is  ideal  for  exploring  data  struc¬ 
tures  and  features  of  interest.  [Bef.  6] 

The  foundation  of  the  Draftsman  program  revolves  around 
the  graphical  plotting  features  of  GRAFSTAT,  and  in  partic¬ 
ular  the  scatter  plot  option.  This  option  requires  the  user 
to  input  the  two  variables  of  interest,  size  and  location  of 
the  plots,  as  well  as  any  headings  desired.  Figure  4.1  is 
an  example  of  the  scatter  plot  input  screen,  called  The 
alphanumeric  screen  (ANS)  in  GRAFSTAT. 

Correct  alignment  of  the  series  of  scatter  plots  so  that 
a  commonality  of  axis  between  adjacent  plots  exist  involves 
an  automatic  reiterative  calling  of  the  scatter  plot 
graphics  function.  Each  input  parameter  of  the  basic 
scatter  plot  function  is  assigned  as  a  local  program  vari¬ 
able.  Eased  upon  the  input  data  structure,  the  Draftsman 
program  sytematically  selects  the  variables  to  be  plotted, 
appropriately  labels  each  axis,  and  determines  the  correct 
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Figure  4.1  GRAFSTAT  Scatter  Plot  Function  Screen. 

location  in  wnich  each  of  the  plots  will  appear  for  display. 
This  methodology  produces  the  entire  array  of  plots  wnile 
eliminating  the  need  for  reiterative  inputs  by  the  user  for 
each  plot.  The  output  is  displayed  as  a  row  of  scatter 
plots  for  eaca  variable  in  the  data  set. 

For  data  structures  consisting  of  five  or  less  vari- 
anles,  the  Draftsman  program  display  will  fit  on  a  single 
page.  The  plotting  of  five  variables  per  page  was  selected 
to  balance  space  limitations  against  tne  need  for  sufficient 
clarity  of  detail  within  the  plots.  To  accomodate  more  toan 
five  variables  on  a  single  page  would  require  smaller  plots 
wnile  reducing  tne  visual  usef ullness  of  the  display  ar.3 
malting  comparisons  inconvenient.  Less  than  five  variarles 
per  page  results  in  the  excessive  use  of  costly  graphic 
reproducing  paper.  For  data  sets  exceeding  five  variables, 
tne  Draftsman  display  is  generated  m  segments  whicn  when 
reproduced  may  oe  pasted  togetner  to  form  a  completed 
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enanles  the  user  to  display  data  sets  consisting  of  more 
than  five  variables.  The  display  procedure  is  limited  only 
by  the  workspace  capacity  of  the  user  computing  facility. 
The  number  of  segments  that  will  result  in  a  Draftsman 
display  can  be  calculated  by  squaring  tne  number  of  vari¬ 
ables  in  the  data  set  and  dividing  by  25.  In  practice,  a 
display  cf  more  than  15  vanaoles  becomes  somewhat  unwieldy 
and  may  negate  the  benefits  of  using  a  Draftsman 
methodology. 

B.  TECHNICAL  DETAILS  OF  INPOT  REQUIREMENTS 

A  two  dimensional  array  of  data  and  a  two  dimensional 
array  of  the  data  variable  names  are  the  minimum  input 
parameters  required  to  generate  a  Draftsman  display.  These 
parameters  are  inputted  as  prompted  by  the  routine  ADMIN. 

Data  may  be  input  directly  as  an  APL  variable  or 

retrieved  from  a  CMS  file  located  on  the  user's  disk.  File 

reading  is  accomplished  by  CMSREAD  [Ref.  6],  a  library 
routine  which  has  teen  pre-copied  into  the  Draftsman 
workspace. 

A  program  entitled  SOB  was  written  to  assist  in  the 
restructuring  of  data  sets  into  more  convenient  formats.  An 
initial  analysis  of  the  basic  Draftsman  display  may  reveal 
certain  variables  or  sections  of  data  points  which  warrant 
closer  scrutiny.  The  SOB  program  allows  the  user  to  select 
variables  from  the  original  data  set  as  well  as  subsamples 

of  a  variable  in  order  to  create  a  new  data  set  entitled 

DATA.  DATA  becomes  the  global  variable  that  is  actually 
displayed.  The  APL  program  SUB  which  implements  this  proce¬ 
dure  is  found  in  Appendix  A. 

The  matrix  of  variable  names  is  either  input  directly  as 
an  APL  two  dimensional  array  or  is  generated  by  the  routine 
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generates  by  1A5ZLS,  eacn  variance  name  entered  containing 
less  tear.  20  characters  is  ^adied  wit a  clanx  spaces.  If 
more  tnan  twenty  characters  are  entered,  only  tr.e  first 


twenty  enaracters  will  appear  on  tne  display. 


.  a  is  assure; 


tr.at  the  entire  array  wnen  passed  to  succeeding  routines  is 
a  valid  rectangular  cr.arac  ter  array.  The  LA3ELS  routine 
wnich  implements  this  procedure  is  found  in  Appendix  A. 

C.  ENHANCEMENT  ROUTINES 
1.  Jitter  Soutine 

As  discussed  in  Chapter  II,  overiarping  of  plotted 
values  nay  be  misleading  and  inadeguately  portray  the  visual 
relationships  exhibited  in  the  data  structure.  The  solution 
is  to  jitter  or  add  random  noise  to  one  or  botn  variables  to 
be  plotted.  This  technique  is  presented  as  an  enhancement 
option  to  the  user  and  requires  only  an  identification  of 
the  variables  upon  which  jittering  will  he  performed. 

The  jittering  of  variaole  points  within  the 
Draftsman  program  is  accomplished  through  a  method  discussed 
ty  Chambers  [Ref.  1  :pp. 106-107].  we  let  Ui,  for  i=1  to  n  ( 
tne  numrer  of  observations  ),  be  n  equally  spaced  values 
from  -1  to  +1  in  random  order.  The  original  variatle  values 
are  thus  reexpressed  in  jittered  form  Ji, 


Ji  = 


(egn  4.1) 


wnere  0x  is  .05  times  the  range  of  tne  variable  data  values. 
This  method  results  in  a  fractional  smft  of  tne  data  values 
along  tne  same  axis  in  wnich  the  vanacle  is  plotted. 


The  snail  shifting  of  plotted  points  is  sufficient 
to  negate  the  effects  of  overlaps  while  preventing  any 
serious  corruption  of  the  plotted  data.  It  shows  the 
multiplicity  of  points  at  each  actual  coordinate.  The  orig¬ 
inal  data  values  can  te  recovered  oy  rounding  to  the  nearest 
integer.  Internally  the  Draftsman  program  uses  the  original 
data  to  create  local  variables  which  are  jittered  and  then 
plotted.  This  enables  the  user  to  always  maintain  the  data 
in  original  form. 

2.  Transforma tion  Rout ine 

The  potential  for  transforming  variables  was  written 
as  a  user  option  to  further  enhance  the  basic  Draftsman 
display.  This  routine  maximiaes  the  characteris tics  of  the 
APL  primitive  functions  as  well  as  parallel  array 
processing.  The  program  requires  the  identification  of  the 
variables  desired  to  te  transformed  and  the  appropriate  APL 
expressions  for  each  transformation.  The  parallel 
processing  capability  transforms  each  variable  set  in  one 
operation.  As  in  the  jitter  routine,  the  transformation 
routine  transforms  only  local  variables  for  plotting  and 
leaves  the  original  data  structure  intact. 


7.  M  ANALYSIS  OF  AUI2I2BILE  DATA 

A.  INTRODUCTION 

An  analysis  is  presented  of  data  consisting  of  selected 
characteristics  of  automoniles  manufactured  during  1933  and 
tested  by  Consumers  Union.  The  primary  purpose  of  tnis 
chapter  is  to  demonstrate  an  application  of  Draftsman 
displays  in  exploratory  data  analysis.  The  analysis 
initially  explores  the  general  descriptive  qualities  of  the 
characteristics  of  automobiles  using  the  nasic  Draftsman 
display  procedures.  Subsequent  analysis  focuses  on  observed 
variables  cf  interest  as  developed  tnrough  the  enhancement 
features  of  Draftsman. 

B.  THE  AUTOMOBILE  DATA 

The  data  was  initially  formatted  as  a  two  dimensional 
array  consisting  of  106  rows  and  14  columns.  Each  row  of 
the  data  matrix  corresponds  to  one  of  the  106  different 
models  of  automoniles  as  tested  by  Consumer  Union  [Ref.  2 
:pp.  3 20-356  ].  The  columns  contain  various  characteristics 
for  each  of  the  automobiles.  Ttiese  fourteen  variables 


comprise  the  three  general  categories  of  price,  performance, 
and  size.  The  price  category  consists  of  tne  suggested 
retail  price  of  the  basic  automobile  witrout  additional 
options.  The  performance  variables  include  fuel  efficiency 
(city  and  highway),  turning  diameter,  gear  ratio,  and 
venicle  repair  records  for  the  two  preceeding  years.  The 
size  variables  consist  of  length,  weigct,  headroom,  rear 
seating  space,  trunk  size,  ana  engine  displacement.  A 
general  variable,  automobile,  corresponus  to  each  of  tne 
specific  models  upon  which  the  data  is  based.  A  summarized 
description  of  the  data  is  snovn  in  Taole  II  for  reference. 


Automobile 

! 

TABLE  II  1 

Data  Characteristics  I 

1 

VARIABLE 

UNITS 

REMARKS 

Automoo! le 

1  to  AS:  small  ears 

Price 

$1003 

AS  to  98  mid  cars 

99  to  106;  large  cars 

MPG  CTty 

HI  les  oer  gallon 

EPA  rated 

WPG  Highway 

Reoaf r  81 

0»  not  rated. 

Reoaf r  82 

1»  very  poor 

Heaaroom 

Irenes 

2-  ooor 

3-  average 

A*  good 

5»  very  good 

Rear  seat  soace 

Incnes 

Trunk  size 

Cuofc  Feet 

Weight 

pounds 

Length 

fnches 

Turning  radius 

Feet 

Engine  dtsolacement 

Cue f  c  f  nenes 

Gear  ratio 

—  J 

C.  PRELIMINARY  ANALYSIS 

1 .  General 

The  general  Draftsman  display  of  variables  was 
generated  as  discussed  in  chapter  II.  A  reduced  version  of 
the  basic  displays  is  seen  in  figures  5.1  through  5.4  .  The 
actual  Draftsman  displays  used  for  analysis  may  be  found  in 
Appendix  B.  For  convenience  and  clarity,  individual  scatter 
plots  from  the  displays  will  be  included  within  applicable 
sections  of  the  text. 
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Figure  5.2  Segment  2  of  Automobile  Data 


Figure  5.4  Segnent  4  of  Automobile  Data 


The  general  ccaracteristics  of  automobiles  are 
likely  to  be  familiar  to  most  readers.  Intuitively  we  car. 
perhaps  surmise  many  of  tne  relationsnx ps  of  the  data  struc¬ 
ture  without  even  locking  at  it.  This  familiarity  however 
will  enable  as  to  concentrate  more  on  tne  features  of  the 
Draftsman  program  in  exploring  the  data.  Additionally,  we 
may  confirm  intuitive  knowledge  or  perhaps  change  seme  of 
our  perspectives  based  upon  the  analysis. 

2-  Characteristics  of  Price 

Focusing  on  price  as  relating  to  the  other  parame¬ 
ters,  the  first  visual  message  imparted  in  figure  5.5,  is 
that  price  bands  deliniate  the  major  categories  of  automo¬ 
biles.  Generally  the  small  sized  cars  are  grouped  at  under 
$10,000  while  midsize  models  are  rather  tightly  grouped 
between  $7,500  and  $12,  000.  If  we  concentrate  on  deviations 
from  this  pattern,  the  outliers  reveal  an  interesting 
feature.  From  each  major  size  category  to  the  next  there  is 
a  substancial  increase  in  the  number  of  outliers  within  the 
categories.  These  outliers  are  predominately  luxury  models 
within  their  respective  categories. 

When  price  and  weight  are  compared,  a  gentle  upward 
sloping  trend  dominates,  denoting  that  price  and  weight  are 
positively  related,  which  is  to  oe  expected  (figure  5.5  ). 
This  relationship  levels  off  at  about  $10,000.  A  very 
obvious  branch  from  the  main  trunk  of  observations  shows 
price  increasing  relative  to  weight  at  a  greater  rate.  The 
presence  of  this  branch  suggested  additional  research  to 
determine  if  a  significant  parameter  was  missing  from  the 
data.  The  research  revealed  this  uppermost  branch  consisted 
of  luxury  style  models,  with  all  but  one  of  foreign  manufac¬ 
ture.  The  majority  of  the  outliers  contained  between  the 
two  branches  are  the  luxury  models  of  American  origin.  We 
might  conclude  that  weight  is  generally  associated  witn  an 
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Figure  5.5  Characteristics  of  Price. 

increase  in  price,  with  the  foreign  luxury  models  tending  to 
be  more  expensive  than  American  luxury  models  of  comparable 
weight. 

Similiar  upward  curving  r elati onsnips  as  that 
observed  in  the  scatter  plot  of  price  versus  weight  can  be 
seen  in  some  of  tne  other  plots  in  the  price  row  in  figure 
5.5  .  The  plot  of  price  and  weignt  is  most  closely  resem¬ 
bled  by  that  of  price  and  length.  Tms  however  may  be  some¬ 
what  deceiving.  A  little  thougnt  might  lead  us  to  conclude 
tr.at  these  similarities  are  more  due  to  a  relationship 
between  length  and  weight.  Consistent  with  tnis  are  the  two 
plots  containing  the  parameters  of  rear  seating  space  and 
trunk  size  versus  price.  Althougn  they  loosely  resemcle  the 
pattern  of  the  price  versus  weight  plot,  we  should  suspect 
that  they  are  influenced  more  by  the  overall  dimension  of 
automobile  length.  These  plots  demonstrate  the  care  that 
must  be  taken  in  the  analysis  of  single  scatter  plots.  We 
must  be  cautious  since  each  scatter  plot  in  the  array 
denotes  only  tne  isolated  relationship  of  two  variables  and 
may  not  necessarily  indicate  a  causal  r  ela  tionshi  p. 
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In  order  to  eliminate  th 
discrete  valued  maintenance  ratings. 
If  we  look  at  price  with  respect  to  t 
of  maintainability  in  figure  5.5,  we 
majority  of  the  vehicles  rated  co 
Vehicles  on  wnich  no  maintenance  rec 
denoted  by  a  0.  Significant  of  the 
they  are  very  evenly  distributed  acr 
tenance  scores.  This  suggests  that 
insure  the  maint ainability  of  an  auto 

3.  Characterist ics  of  Size 

As  an  overall  expression  of  s 
show  a  fairly  tight  linear  relati 
figure  5.6  .  Comparing  these  two 

respective  rows  indicates  that  they 
similiar  relationships  with  the  ot 
respect  to  the  size  variables  of  head 
trunk  space,  tighter  relationships 
Weight  on  the  other  hand  has  a  tighte 
engine  disp lacement  parameter  {see  fi 

Bear  seating  and  trunk  siz 
increasing  relative  to  length.  This 
might  expect  since  a  longer  external 
result  in  larger  internal  size  featu 
is  the  factor  of  headspace,  which  de 
trend  of  the  other  internal  size  di 
length  increases  in  figure  5.6,  the 
headspace  to  encompass  a  widening  r 
broadest  range  is  at  106  inches  wher 
1  to  h  inches.  As  vehicles  become 
space  sizes  shifts  to  the  larger  v 
being  limited  in  range  from  3  to  5  in 
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Figure  5.6  Size  and  Internal  Dimensions. 


The  tendency  for  increased  car  weight  to  be  associ¬ 
ated  with  a  related  increase  in  engine  displacement  is 
another  observation  we  would  expect,  and  is  seen  in  figure 
5.7  .  Cf  significance,  is  that  displacement  versus  weight 
fall  into  two  distinct  types  of  groups.  Vehicles  weighing 
up  to  3200  pounds  have  a  very  tight  increasing  linear  rela¬ 
tionship  with  engine  displacements  up  to  175  cubic  inches. 
The  vehicles  of  larger  weight  capacity  are  seen  to  be  asso¬ 
ciated  with  larger  engine  displacements  albeit  with  a  more 
dispersed  cluster  of  points. 

Engine  displacement  in  turn  can  be  seen  to  have  a 
definite  correspondence  to  the  overall  automobile  catego¬ 
ries.  A  close  look  at  the  second  plot  in  figure  5.7  reveals 
that  small  cars  tend  to  be  banded  with  engine  displacement 
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Figure  5.7  Size,  Displacement  and  Vehicle  Hodel. 
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4. 


Vehicle  Per f oraance 

Consumers  should  nave  a  part icular  interest  in  the 
fuel  efficiency  characte ristics  of  automobiles.  Not 
surprising  is  the  trend  snown  in  figure  5.8  that  correlates 
fuel  efficiency  in  the  city  wita  fuel  efficiency  on  the 
highway.  A  comparison  of  the  remaining  variable  plots  for 
these  efficiency  parameters  shows  identical  relationships  in 
all  cases.  The  original  data  structure  could  probably 
exclude  one  of  these  fuel  efficiency  variables  without  loss 
of  information  if  we  needed  to  condense  the  data. 
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Figure  5.8  Fuel  Efficiency,  Weight,  and  Displacement. 

The  inverse  relationship  of  fuel  efficiency  to 
vehicle  weight  should  not  be  unexpected  and  confirms  our 
intuition  in  this  regard  {figure  5.8).  Hign  fuel  effi¬ 
ciency,  low  weight,  and  smaller  engine  displacements  are  all 
associated. 

As  previously  mentioned,  price  alone  is  not  an  indi¬ 
cator  of  automobile  maintainability.  An  interesting  obser¬ 
vation  however  can  be  drawn  from  the  relationship  exhibited 
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between  repair  record  from  one  year  to  toe  next.  The  pi 
or  Repair  81  versus  Repair  82  in  figure  5.9  indicates  a 
strong  positive  correlation  between  tne  two.  In  alxost  all 
instances  the  maintainability  does  not  change  for  better  or 
worse  by  more  than  one  level.  Furthermore,  tne  number  of 
automobiles  tnat  improved,  deteriorated,  or  did  not  change 
in  terms  of  maintainability  are  approximately  e^ual. 
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Figure  5.9  Haintainability  of  Automobiles. 

Repair  when  compared  to  engine  displacement  reveals 
that  the  predominant  number  of  rated  vehicles  (i.e  vehicles 
for  which  repair  data  was  available)  contained  the  smaller 
displacement  engines.  This  concentration  of  better  mainte¬ 
nance  values  at  the  lower  displacement  level  .  suggests  that 
smaller  engines  have  better  maintenance  records. 

D.  ANALYSIS  WITH  ENHANCED  DISPLAY 
1 .  General 

The  analysis  of  the  basic  draftsman  display  revealed 
a  wealth  of  features  pertaining  to  the  individual  variables 
within  the  data.  One  distinct  feature  evident  is  the  poten¬ 
tial  relationship  between  foreign  and  American  manufactured 
automobiles.  While  not  an  original  parameter  of  the  data, 
the  plots  of  price  versus  weight  and  price  versus  model 
indicates  that  this  influence  warrants  closer  scrutiny. 
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In  the  basic  display  the  overlapping  of  maintenance 
values  was  alleviated  with  jittering.  The  array  of  plots 
dealing  with  headroom  space  suggests  that  this  variable  also 
should  ne  treated  likewise.  The  remaining  automobile  cnar- 
acteristics  do  not  suffer  from  any  significant  problems  with 
overlapping  values. 

Eased  upon  the  preliminary  analysis,  an  ennanced 
display  was  generated  for  subsequent  evaluation.  The  redun¬ 
dancy  of  the  two  fuel  efficiency  variables  was  resolved  by 
eliminating  miles  per  gallon  on  the  highway.  The  remaining 
variables  were  reordered  to  place  those  with  similiar  rela¬ 
tionships  in  closer  proximity.  The  enhanced  display  also 
introduces  a  new  discrete  category  variable,  location  of 
manufacture.  A  value  of  1  for  this  variable  corresponds  to 
those  automobiles  produced  in  America,  those  vehicles 
produced  overseas  under  an  American  brand  name  are  denoted 
with  a  2,  while  foreign  models  are  assigned  a  3. 

The  introduction  of  location  of  manufacture  dramati¬ 
cally  portrays  some  very  evident  dichotomies  which  exist 
between  foreign  and  American  made  automobiles.  In  general, 
the  array  of  plots  consisting  of  these  parameters  indicates 
a  very  different  orientation  on  the  part  of  the  respective 
manufactures  in  their  approach  to  the  automobile  market. 

The  potential  for  transforming  the  data  through 
transformations  was  considered.  Transforming  engine 
displacement  with  a  log  transform  slightly  straightens  the 
plots  containing  this  variable  with  respect  to  some  of  the 
other  size  parameters  as  seen  in  figure  5.10  .  This  reex¬ 
pression  however  does  not  really  enhance  the  description  of 
the  data  and  hence  was  not  included  in  the  final  display. 

The  complete  enhanced  draftsman  display  may  be  found 
in  Appendix  B.  Isolated  portions  of  this  display  will  be 
reproduced  within  this  section  of  the  text  for  clarity. 
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Figure  5.  10  Log  Transformation  of  Engine  Displacement. 

2 .  Price 

The  majority  of  American  and  foreign  cars  are  prima¬ 
rily  priced  below  $12,000.  Extremely  visible  is  the  large 
grouping  of  American  models  in  the  neignbornood  of  $9000  to 
$11000  as  depicted  in  figure  5.11  .  American  model  prices 
beyond  this  level  tend  to  increase  in  a  ratner  uniform 
fashion  of  $2000  increments,  up  to  $24000.  In  contrast, 
foreign  car  prices  are  fairly  uniformly  distributed  in  the 
region  of  $5000  to  $15000  with  subseguent  price  hikes  in 
larger  increments  of  $5000,  to  the  maximum  level  of  $35000. 


Figure  5.11  Location  of  Manufacture  and  Price. 


3.  Size 


In  general,  automoniles  of  American  ana  forei. 
origin  fall  within  two  distinct  sire  ranges  (figure  5.12) 


Iii  terns  of  tne  major  dimensions  of  lengtn  and  neijut,  toe 
plot  arrays  provide  some  distinguishing  features  contrasting 
location  of  manufacture.  Not  very  surprising  is  t..at 
American  vehicles  tend  to  the  longer  and  heavier  side  vr.ile 
foreign  manufactured  cars  tend  to  be  snorter  and  lignter. 


Figure  5.12  location  of  Manufacture  and  Size. 


In  view  of  the  propensity  for  foreign  cars  to  se 
snorter  than  the  American  counterparts,  the  plots  of  the 
related  inner  size  characteristics  snown  in  figure  5.12  is 
somewhat  unexpected.  An  evaluation  of  rear  seating  space, 
trunk  size,  and  headroom  shows  that  tne  distribution  or 
values  for  foreign  produced  venicles  is  sligntiy  shifted  to 
the  smaller  dimensions  in  contrast  to  the  respective 
American  distributed  values.  This  is  consistent  with  obser¬ 
vations  noted  in  the  casic  display.  What  is  unexpected,  is 
that  the  differences  based  upon  the  saifts  is  much  smaller 
tnan  we  might  expect  given  the  prevalent  difference  in 
length  distributions  between  American  and  foreign  cars. 

In  conclusion,  the  rear  seat  spaciousness  of 
American  cars  is  fairly  evenly  districted  between  25  and  30 


inches  with  the  large  models  being  outliers  at  32  incr.es. 
The  foreign  models  are  more  widely  dispersed  from  20  to  25 
inches.  Thus  at  the  upper  end  of  tne  spaciousness  scale 
there  is  actually  only  a  three  inch  advantage  by  the  largest 
of  the  k  crican  models. 

The  characteristic  of  headroom  denotes  a  similiar 
relationship  as  that  observed  for  rear  seating  space  (figure 
5.12).  Again,  approximately  50%  of  tne  foreign  car  headroom 
values  fall  within  the  same  distribution  range  (2  1/2  to  4 
1/4  inches)  as  that  of  tne  vast  majority  of  the  American 
models. 

The  differences  between  trunx.  sizes  seen  in  figure 
5.12  is  a  bit  more  acute  in  that  the  foreign  models  range 
from  10  to  14  cubic  feet,  while  the  American  cars  are  skewed 
toward  12  to  14  cubic  feet.  Clearly,  in  spite  of  the 
distinct  length  differences  between  foreign  and  American 
produced  cars,  the  differences  in  internal  dimensions  is 
much  subtler  and  smaller  than  we  might  have  originally 
suspected.  The  foreign  cars,  althougn  smaller  in  overall 
length  ,  have  approximately  the  same  internal  size  features 
as  all  but  the  very  largest  American  made  cars.  It  is  also 
interesting  to  note  that  the  American  sponsored  nut  produced 
overseas  models  tend  to  exhibit  tne  characteristics  of  the 
foreign  models. 

4 .  Performance 

The  distribution  of  the  fuel  efficiency  characteris¬ 
tics  of  American  and  foreign  automoniies  appears  to  be  tne 
inverse  of  their  weight  (figure  5.13  ).  Ihe  heavier 
American  cars  tend  to  be  evenly  distribute;,  between  9  and  20 
mpg  with  only  three  outliers  extending  beyond  25  mpg.  The 
lignter  foreign  cars,  while  ranging  from  15  to  28  mpg,  are 
rather  tightly  grouped  between  20  and  25  mpg.  The  outlier 


fuel  efficiency,  the  foreign  venicits  certainly  dominate 
this  attribute  of  performance. 

Perhaps  the  most  revealing  plots  in  tae  ennanced 
display  are  tnose  of  the  repair  records.  In  rota  recorded 
years  the  American  models  are  rather  evenly  iistnruted  from 
poorer  than  average  (1)  to  average  (3)  maintainability 
ratings.  A  batter  than  average  rating  (4)  was  achieved  only 
four  times  over  both  years.  In  extreme  contrast,  the 
foreign  models  during  both  years  show  a  tendency  toward  the 
much  better  tnan  average  ma int a inaoil i ty  rating  (5).  As  in 
the  characterist ic  of  fuel  efficiency,  the  foreign  models 
dominate  this  performance  variable  of  maintamanility. 
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Figure  5.13  Location  of  Manufacture  and  Performance. 


E.  CCMC  ID  SION  S 

The  car  data  is  an  excellent  example  of  how  the 
Draftsman  display  can  be  used  to  describe  a  data  set.  The 
various  parameters  associated  wita  automobiles  can  be  very 
confusing  to  the  consumer.  No  one  single  parameter  can  be 
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.selected  as  an  overall  measure  of  wnat  constitutes  a  "lest" 
automobile.  What  one  consumer  may  find  desiraDle,  another 
consumer  may  find  unacceptable.  Thus,  describing  or 
modeling  the  data  with  more  formal  statistical  tecnniques 
such  as  linear  regression  is  not  very  applicable.  The 
Draftsman  display  enables  the  user  to  observe  tne  multivar¬ 
iate  affects  of  each  of  the  various  parameters.  3ased  upon 
the  selection  of  one  or  more  parameters,  the  user  can  deter¬ 
mine  the  impact  relative  to  other  parameters. 
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VI.  AN  ANALYS IS  OF  CONTRACT  DATA 

A.  INTRODUCTION 

A  graphical  analysis  is  presented  using  tne  Draftsman 
display  on  data  collected  concerning  selected  Naval 
contracts  signed  during  the  period  1949  througn  1963.  This 
chapter  explores  the  general  descriptive  dualities  of  eleven 
categories  of  contractual  information  relative  to  the 
performance  of  the  contracts. 

The  data  originally  was  analysed  in  a  Thesis  completed 
in  1373  [Bef.  7],  tnrough  regression  and  analysis  of  vari¬ 
ance  technigues.  Significant  in  this  study  was  the  conclu¬ 
sion  that  there  was  no  clear  method  of  describing  the 
rela tionships  between  contract  parameters  and  the  subsequent 
performance  of  tne  contracts.  It  is  this  authors  opinion 
that  the  analysis  failed  because  the  use  of  linear  regres¬ 
sion  alone  is  not  sufficient  to  adequately  describe  tne 
relationships  present  in  tne  data.  The  analysis  presented 
based  upon  a  Draftsman  display  suggests  that  this  method  of 
exploratory  data  analysis  reveals  a  variety  of  relationships 
do  exist  describing  contract  performance  relative  to  tne 
contractual  parameters. 

E.  THE  CONTRACT  DATA 

The  data  consist  of  177  contracts  which  comprise  ail 
Naval  aircraft  and  missile  fixed-price  incentive  contracts 
completed  during  the  period  1949  tarougn  1963.  The  data  as 
provided  by  the  Naval  Material  Command  encompasses  11  param¬ 
eters  as  follows : 

1.  Deviation  from  target  cost  (percent). 

2.  Months  to  complete  contract  (months). 
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TABLE  III 

Description  of  Variable  Coding 

Codes  for  variable  9  Codes  for  variable  11 

YEAR  SIGNED  SYSTEM  TYPE 


1  -  1949 

2  -  1950 

15-  1963 


Codes  for  variable  10 
MANUFACTURER 


1 

Beecb 

7 

Hiller 

2 

LTV 

a 

Kaman 

3 

Corva  1  r 

9 

Martin 

4 

Douglas 

10 

McDonald 

5 

Boeing 

11 

N.  American 

6 

Grumman 

12 

Vertol 

1  Utility  Airplane 

2  Combat  Airplane 

3  Missile 

4  Blimp 

5  Helicopter 

6  Drone 

7  Airborne  Equipment 


13 

Ryan 

19 

Phllco 

14 

Sikorsky 

20 

Maxson 

15 

Bell 

21 

NortbroD 

16 

Lockeed 

22 

Ra  f'tbeon 

17 

Bendl  x 

23 

Aerojet 

18 

Gen  Elect. 

C-  THEORY  OF  FIXED-PRICE  INCENTIVE  CONTRACTS 


The  concept  behind  fixed-price  incentive  (F?I)  contracts 
is  that  they  are  intended  to  be  used  in  the  development. 


management  support,  a  n d  pro:  JCtur.  at  items  i:  wr.icr.  toe 
uncertainty  of  cost  is  too  great  to  allow  a  fir 3- fixed  price 
(???)  contract  attractive  to  bidders.  In  theory,  tne  F?r 
contract  should  activate  control  of  costs  by  rewarding  the 
manufacturer  with  a  greater  profit  level  as  costs  are 
reduced  below  the  negotiated  target  cost. 

The  incentive  feature  of  tne  F  ?I  contract  scould  influ¬ 
ence  the  contractors  to  affectively  manage  cost  associated 
decisions  in  a  manner  beneficial  to  profit.  This  in  turn 
sr.ould  result  in  a  favorable  cost  outcome  to  the  government 
as  well.  This  mutually  favoranle  outcome  is  communicated  in 
the  form  of  the  sharing  ratio  whicn  estaniisnes  the  amount 
of  money  whicn  will  be  returned  to  the  contractor  for  every 
dollar  saved  below  the  target  cost  of  the  program.  For 
example,  a  75/25  sharing  catio  returns  25%  of  every  dollar 
saved  to  the  contractor  while  reducing  the  governments 
expected  cost  tv  .75  dollars.  Tne  Higher  the  percent 
returned,  the  greater  the  potential  for  gain  or  lose  to  the 
contractor.  Hence,  lower  sharing  ratios  reflect  a  greater 
degree  of  financial  risk  to  the  contractor. 

The  ceiiiag  pcice  of  a  contract  is  a  control  measure  to 
avoid  excessive  cost  overruns  to  tae  government.  Tae 
ceiling  price  establishes  tne  maximum  amount  of  cost  wnich 
will  be  paid  cy  the  government.  Waea  final  cost  exceeds  ..he 
ceiling  cost,  the  difference  must  be  Dome  out  of  pocnet  by 
the  contractor  as  a  loss.  Cost  outcomes  which  fall  between 
the  negotiated  target  and  ceiling  values  result  in  a  break 
even  venture  to  tne  manufacturer. 

D.  PEELIHI NAHY  ANALYSIS 

1 .  General 

A  Draftsman  display  of  the  clever,  contractual  param¬ 
eters  was  generated  for  preliminary  analysis.  host 
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noticeable  was  tnat  nine  of  the  eleven  variables  consisted 
of  discrete  values  which  resulted  in  substantial  overlapping 
of  plotted  points  throughout  tne  display.  The  plots 
containing  the  parameter  of  number  of  items  produced  indi¬ 
cates  a  pronlem  in  scaling.  Contracts  range  in  size  from  40 
to  1400  items.  This  problem  as  shown  in  figure  1.1  ,  is 
caused  by  the  eignt  extrema  outlier  contracts  containing  in 
excess  of  801  items.  The  compression  of  the  remaining 
majority  of  the  contracts  into  a  very  small  segment  of  the 
plats  would  prevent  otserva tions  of  any  meaningful  value. 


Figure  1.1  Number  of  Items  per  Contract. 


A  subsequent  Draftsman  display  was  generated  to 
alleviate  the  proDlem  of  overlap  as  well  as  the  scaling  of 
number  of  items  per  contract.  To  take  care  of  the  overlap 
problem  all  variables  except  deviation  from  target  cost, 
target  cost,  number  of  items,  and  manufacturer  were 
jittered.  To  take  care  of  the  scaling  problem,  a  log  trans¬ 
formation  was  used  on  the  variable  of  number  of  items. 

The  Draftsman  segments  generated  with  enhancements 
were  reduced  and  are  shown  in  figures  1.2  through  1.5  .  For 
convenience  and  clarity  of  discussion  appropriate  plots  will 
he  reproduced  within  the  body  of  tne  cnapter  text.  The 
original  dispLay  segments  may  be  seen  in  Appendix  C. 
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i.gure  6.2  Draftsman  Segment  1,  Contract  Data 


Figure  6.  3  Draftsman  Segment  2,  Contract  Data 


Figure  6.4  Draftsman  Segment  3,  Contract  Data 


Figure  6.  5  Draftsman  Segment  4,  Contract  Data 


2.  Characteristics  of  Size 


a.  Volume  of  Contracts 


As  a  preliminary  expression  of  size,  the  number 
of  contracts  per  year  provides  an  estimate  for  tne  volume  of 
contracts  let  during  a  particular  period  of  time.  we  might 
suspect  that  a  low  volume  of  contracts  would  create  a  more 
competitive  atmosphere  among  manufacturers  as  they  attempt 
to  maintain  their  facilities  in  a  production  mode.  A  high 
volume  of  contracts  let  offers  a  greater  opportunity  for 
manufacturers  to  select  contracts  in  which  they  have  the 
greatest  amount  of  expertise  and  experience.  The  latter 
case  has  a  greater  potential  for  controlling  costs  as  well 
as  manufacturer  profits.  Thus,  we  might  expect  that  as  the 
number  of  contracts  let  increasess,  the  positive  deviations 
from  target  cost  should  decrease. 

The  plot  of  cost  deviation  versus  number  of 
contracts  seen  in  the  first  plot  of  figure  6.6  does  appear 
to  generally  support  this  hypothesis.  As  the  volume  of 
contracts  increases  there  is  a  tendency  for  cost  deviation 
to  be  negative.  In  fact,  the  greater  the  volume,  the 
greater  in  magnitude  the  negative  cost  deviation. 

The  cost  deviations  versus  volume  relationship, 
when  compared  over  time,  also  suggest  that  the  time  at  which 
the  contracts  were  signed  may  have  additional  bearing  (see 
figure  6.6).  The  rapid  increase  in  contract  volume  from 
1949  to  1951  is  characterized  by  large  absolute  deviation 
from  target  cost  (though  generally  negative).  As  volume 
declined  from  1951  through  1955,  the  absolute  deviations 
from  target  cost  can  be  seen  to  be  much  smaller  and  roughly 
equally  distributed  between  positive  and  negative.  The 
subsequent  volume  increase  experienced  from  1955  to  1958 
also  shows  a  increase  in  the  absolute  deviations  from  target 
cost  (with  a  fair  tendency  toward  negative  deviations).  The 


volume  incline  from  1958  to  1964  is  soaewnat  more  difficult 
to  interpret  due  to  the  relatively  low  volume  of  contracts. 
Absolute  deviations  do  appear  to  cecoae  slightly  smaller  as 
volume  decreases.  However,  when  contrasted  to  previous 
years  of  similiar  volume,  the  deviations  while  becoming 
smaller,  appear  to  he  doing  so  to  a  xesser  degree  than 
previously.  The  last  three  years  of  the  data  period,  while 
characterized  both  by  low  volume  as  well  as  a  small  aosolute 
deviation  from  target  costs,  clearly  shows  a  tendency 
towards  positive  cost  deviation. 
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Figure  6.6  Contract  Volume. 


b.  Contract  Duration 

Based  upon  production  aanageaent  tecntigues,  we 
mignt  expect  that  the  duration  of  a  contract  would  have  a 
relationship  to  contract  performance.  Short  term  contracts 
leave  little  time  for  management  to  adjust  production  activ¬ 
ities  to  maximize  the  efficiency  of  operations.  As  contract 
duration  increases,  a  greater  opportunity  is  afforded  to 


contractors  to  learn  by  early  production  errors  and  make  tne 
cost  related  decisions  necessary  for  control.  x-er. 
contracts  duration  extends  far  into  tne  future,  difficulties 
can  arise  by  external  economic  influences  which  could  not  be 
accurately  forecast  at  the  onset. 


Figure  6.7  Contract  Duration  and  Performance. 

The  isolated  plot  of  deviations  from  cost  rela¬ 
tive  to  contract  duration  reproduced  in  figure  6.7  reveals 
some  interesting  features.  The  contracts  of  less  than  40 
months  duration  exhibit  widely  dispersed  deviations  from 
target  cost.  For  the  contracts  which  lasted  between  40  and 
70  months  tr.e  cost  deviations  exnibit  a  clear  trend  toward 
the  negative  side.  Contracts  waicn  exceed  70  montns  in 
duration  shew  an  increased  deviation  that  is  roughly  equally 
split  between  positive  and  negative  . 

The  cost  deviation  characteristics  relative  to 
duration  noted  above  appear  to  nold  irrespective  of  the  year 
in  which  the  contracts  were  signed  (see  figure  6.7  ). 
Contracts  of  less  than  40  months  duration  as  well  as  those 
between  40  and  70  months  are  fairly  equally  distributed 


across  all  years  of  the  data  period.  Whetner  contracts 
which  exceed  70  months  in  duration  are  effected  by  the  year 
signed  is  not  determinable  since  ail  tut  one  of  these 
contracts  occured  during  the  first  five  years. 

c.  Target  Cost  of  Contracts 

The  target  cost  parameter  provides  a  measure  of 
the  financial  size  of  the  contracts  let.  The  plot  of  this 
variable  with  respect  to  deviation  from  target  cost  is  shown 
in  figure  6.9  .  The  greatest  absolute  deviation  from  target 
cost  can  be  observed  when  target  cost  is  less  than  100 
million  dollars.  In  this  region/  tne  deviations  tend  to  be 
negative  but  only  by  a  slight  numerical  margin.  The 
contracts  which  exceeded  a  target  cost  of  100  million 
dollars  are  clearly  seen  to  exhibit  a  smaller  absolute  devi¬ 
ation  from  the  target  cost.  These  contracts  are  further 
char acterized  by  generally  favoring  a  negative  cost 
deviation. 


Figure  6.8  Target  Cost  and  Performance. 

3.  Incentive  Me asur es 
a.  Sharing  fiatio 

The  sharing  catio  establishes  the  amount  of 
money  wnich  will  be  returned  to  the  contractor  for  every 
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dollar  saved  in  cost.  This  return  reflects  tne  potential 
for  profit  to  the  manufacturer.  Tne  r.  1  g r. er  the  sharing 
ratio,  the  aigher  the  potential  gain.  This  r eiatior.ship 
appears  to  le  echoed  in  the  plot  or  tnese  two  variables  as 
seen  in  figure  6.9  .  As  the  sharing  ratio  increases  so  does 
the  relative  expected  profit  level  of  tne  lanufacturer.  Tne 
risk  factor  associated  with  contract  duration  can  also  be 
onserved  in  the  sharing  ratio.  As  contract  duration 
increases  the  potential  for  influence  by  otner  external 
economic  parameters  can  less  accurately  be  forecasted.  The 
general  decline  of  sharing  ratios  as  contract  duration 
increases  can  be  seen  in  figure  6.9  .  This  decline  may  be  a 
sign  of  the  contractor's  willingness  to  accept  a  lower 
maginal  profit  position  in  order  to  decrease  risk. 
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Figure  6. 9  The  Incentive  of  Sharing  Ratios. 

The  most  striking  observation  about  the  sharing 
ratio  is  that  when  evaluated  with  performance,  little  if  any 
relationship  can  be  determined.  We  can  see  no  clear  indica¬ 
tion  that  any  particular  ratio  can  oe  associated  with  a 
favorable  (negative)  cost  deviation.  This  lack  of  relation¬ 
ship  is  significant  in  that  the  sharing  ratio  is  supposedly 
a  major  incentive  feature  of  fixed-price  incentive 
contracts.  A  determination  that  snaring  ratios  are  an 
insignificant  parameter  suggests  that  this  method  of 
contracting  mignt  warrant  further  analysis  by  tne 
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government.  It  may  :e  ir.terestir.j  to  note  tnat  a  Rand  study 
of  over  400  Air  Force  contracts  resulted  in  a  similiar 
finding  that  tne  sharing  ratio  was  insignificant  with 
respect  to  tha  final  cutcoae  of  contracts  [Ref.  8  p .38]. 

t.  Target  Profit 

Tae  plots  of  negotiated  target  profit  level  in 
figure  6.10  raveal  similiar  character istics  wnen  compared  to 
the  contract  size  parameters.  Very  evident  is  that  target 
profit  in  general  tends  to  revolve  around  the  9%  level. 
Given  the  time  period  during  vhicn  these  contracts  were 
performed,  9%  represents  a  ratner  lucrative  profit  level. 


Figure  6.10  Target  Profit  and  Size. 

Profit  versus  target  cost  is  observed  to  be 
funnel  shaped  with  the  greatest  deviations  in  profit  at  the 
lower  end  of  target  cost.  As  target  cost  increases  the 
variance  of  profit  becomes  smaller  while  stabilizing  about 
tne  9%  level  (figure  6.10).  A  similiar  tnougn  looser  rela¬ 
tionship  exists  with  the  number  of  items  produced.  Target 
profit  deviates  imperceptibly  more  initially  and  generally 


tends  to  fluctuate  about  2  percentage  joints  above  and  below 
tne  9K  level.  The  9?  profit  level  remains  firm  regardless 
of  the  number  of  items. 

Target  profit  plotted  with  duration  and  volume 
shows  a  slight  decline  in  expected  profit  levels  as  either 
duration  or  volume  increase  as  depicted  m  figure  6.11  .  in 
tne  former  situation  this  may  be  a  willingness  to  trade-off 
a  lower  profit  level  for  the  security  of  a  longer  term 
production  operation.  The  inverse  relationship  between 
profit  and  volume  suggests  that  during  low  volume  periods 
contractors  are  attempting  to  make  the  most  out  of  the  few 
contracts  available.  Conversely  when  contract  volume  is 
high  the  expected  profit  level  declines  to  about  9%.  The 
conclusion  might  be  drawn  that  witn  more  contracts  avail¬ 
able,  the  contractors  are  willing  to  accept  a  slightly  lower 
profit  level  per  contract  since  the  opportunity  is  greater 
to  win  multiple  contracts  during  hign  volume  periods. 

As  a  measure  of  performance  the  target  profit 
may  lack  significance  in  determining  the  deviation  from 
target  cost.  The  plot  of  target  profit  versus  deviation 
from  target  cost  shown  in  figure  6.  11  does  not  suggest  a 
describable  relationship.  The  majority  of  tne  manufacturers 
tended  to  negotiate  about  a  9%  profit  level.  An  anlysis  of 
the  eight  most  deviant  outliers  from  this  cnaracteristic 
reveals  that  seven  of  these  outliers  were  by  manufacturers 
witn  these  contracts  being  their  sole  participation  during 
the  entire  15  year  period.  The  comparison  of  target  profit, 
deviation  from  target  cost  and  system  type  is  also  signifi¬ 
cant  with  respect  to  the  outliers.  wniie  there  is  notning 
notable  about  their  target  cost,  the  eleven  most  unfavorable 
cost  outcomes  (positive  cos  t  deviations)  correspond  solely 
to  three  system  types.  These  are  combat  aircrafts, 
missiles,  and  Helicopters  denoted  by  item  types  2,  3,  and  5 
respectively  in  figure  6.11  . 


Figure  6.11  Target  Profit  and  Performance. 


E.  PBELIMI NABY  CONCLUSIOHS 

The  analysis  and  discussion  presented  reveals  that  an 
abundant  amount  of  information  is  visinle  in  tne  Draftsman 
display  of  the  contract  data.  Further,  there  are  indica¬ 
tions  tnat  taere  are  relationships  between  the  contractual 
parameters  and  contractor  performance.  Briefly  summarized 
these  are: 

1.  As  the  volume  of  contracts  let  increases  there  is  a 
tendency  for  the  contracts  to  result  in  a  negative 
cost  deviation  (favoraole  to  tne  government). 

2.  Over  tne  15  year  period  as  volume  caanged  from  year 
to  year  there  appears  to  be  a  related  reaction  rela¬ 
tive  to  contractor  performance.  Periods  denoted  by 
an  increasing  volume  are  reflected  with  an  increase 
in  cost  deviations.  When  volume  declines  a  related 
decline  in  cost  deviations  can  also  oe  observed  but 
at  a  more  cautious  rate. 

3.  Contract  duration  as  related  to  cost  deviation  might 
better  be  described  in  terms  of  snort,  medium,  and 


icng  term  contracts-  me  most  favorarie  contract 
duration  appears  to  re  between  -  0  ana  70  months. 
Inis  relationship  is  fairly  consistent  regardless  of 
the  year  in  which  the  contracts  were  signed  or  tre 
volume  of  contracts  let. 

4.  Fluctuations  in  cost  deviations  tends  to  staniize 
when  the  contracts  contain  more  than  50  items. 

5.  As  the  target  cost  increases  tnere  is  a  greater 
tendency  for  negative  cost  deviation  to  occur. 
Contracts  in  excess  of  100  million  dollars  in  partic¬ 
ular  resulted  ic  predominantly  favoraoie  outcomes. 

6.  The  snaring  ratio  as  a  traditional  incentive  measure 
of  a  FPI  contract  may  lack  merit.  No  relationship 
can  he  observed  between  this  parameter  and  perform¬ 
ance. 

7.  No  obvious  relationship  can  ce  noted  between  target 
profit  levels  and  contract  performance. 

3.  The  ultra  nigh  technology  systems  of  comoat  aircraft, 
helicopters  and  missiles  exnibit  t ne  greatest  poten¬ 
tial  for  adverse  performance. 

F.  ADDITIONAL  CONFIRMATORY  ANALYSIS 
1.  General 

The  preliminary  analysis  using  a  single  iteration  of 
tne  Draftman  display  revealed  a  variety  of  interesting  rela¬ 
tionships  between  contractu  ax  parameters.  Certainly  other 
relationships  exist  which  have  not  been  discussed.  As  an 
exploratory  data  analysis  tool  the  Draftsman  display  enables 
the  user  to  look  at  the  data  at  almost  any  level  of  detail 
desired.  Suoseguent  displays  can  be  generated  on  various 
subpopulations  such  as  eacn  of  tne  manufactures  to  gam 
greater  insignt  to  their  performance  oenavior.  It  is  this 
versatility  in  exploring  data  sets  wmcn  enables  tne  user  to 
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rapidly  process  large  amounts  or  data  ir.  order  to 
feeling  for  tne  interactions  involved. 

The  use  of  the  Draftsman  display  car.  also  assist  the 
user  in  the  application  or  more  formal  statistical 
approaches.  The  use  of  such  techni ^ ue s  as  regression  anal¬ 
ysis  provides  a  confirmatory  measure  to  tne  exploratory 
indications  viewed  in  the  display. 

The  blind  application  of  some  statistical  packages 
without  first  looking  at  the  data  can  result  ir.  erroneous  or 
misleading  conclusions.  This  is  particularly  true  of  large 
data  sets  where  misplaced  decimal  points,  formating  errors 
and  other  related  problems  may  not  be  easy  to  detect.  Tne 
visual  nature  of  the  Craftsman  display  car.  assist  in  identi¬ 
fying  tr.ese  problems  as  well  as  aid  in  selecting  appropriate 
variable  selections  on  which  to  initiate  formal  analysis. 

2.  Cost  Devia  tion  Over  Ti me 

One  major  question  which  cannot  be  readily  answered 
with  the  Draftman  display  of  contract  data  is  the  relation¬ 
ship  of  cost  deviations  over  the  entire  contract  period. 
The  scatter  plot  of  percent  cost  deviation  versus  year 
signed  reveals  a  wide  dispersal  in  cost  deviations  with  no 
clear  visual  trend  apparent  (figure  6.12). 

A  least  squares  iir.ear  model  was  selected  in  order 
to  determine  if  for  all  manufacturers  a  trend  exists 
relating  cost  deviations  to  the  year  m  which  contracts  were 
signed.  The  results  of  this  indicates  that  in  fact  an 
upward  trend  in  cost  deviations  did  occur  from  1949  through 
1964  (figure  6.12).  Tne  compute!  t-value  of  3.3  is  quite 
significant  and  indicates  tnat  the  £.ro  bar  ili  t  y  that  the 
value  of  the  coeficient  3(1)  was  actually  zero  is  much  less 
tran  .05.  The  slope  of  the  regression  line  is  .560  with  tne 
lower  and  upper  confidence  intervals  .  232  and  .  927  respec¬ 
tively.  This  also  clearly  supports  tne  upward  trend  of  cost 
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Figure  6. 12  Contract  Performance  Over  Time. 

While  an  overall  increasing  trend  in  cost  deviations 
is  evident,  the  performance  of  the  individual  manufactures 
involved  might  reasonably  be  expected  to  differ.  The  gener¬ 
ation  of  Draftsman  displays  for  each  manufacturer  would 
provide  the  starting  point  for  comparison.  While  the  entire 
displays  are  not  presented,  the  scatter  plots  of  two  major 
contractors  (Grumman  and  Lockeed),  indicate  how  different 
performance  results  relate  in  the  general  cost  deviation 
pict  ure. 

In  applying  the  linear  regression  model  to  Grumman 
as  seen  in  figure  6.13,  cost  deviations  rose  rapidly  over 
tne  time  period.  This  rise  is  aucn  faster  than  that  seen  in 
figure  6.12  for  all  the  firms  in  general.  From  a  government 
perspective  this  might  suggest  that  a  closer  scrutiny  of 
this  company's  activities  might  be  warranted. 

The  application  of  the  regression  model  to  Lockeed 
as  seen  in  figure  6.14  presents  a  very  dirferent  picture. 
In  this  instance  a  cubic  fit  ratner  taan  a  straignt  line  is 
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Figure  6.14  Lockeed  Contract  Performance. 
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XPDATA[ ((TCtC)] 

"R  (  (  TR  +  R  )  =  (  TC  +  C  )  )/SKIP 

PQSNpPi+((0.18  ”0.18  0.18  '0.18)»((C-1>, 
5  3 
9  3 

-»(  <c=i )x<  (R  =  5)v<  < tr+r ) =ncol >  >  (/graph 
XAXISp • > 

-»  (  C=  1  )  /GRAPH 
XAXtSHH[(TCtC) ; 3 
VOX  I  S«-  •  • 

-*<  <R  =  5)v-<  (TR  +  R)=NCOL)  )  /GRAPH 


GRAPH  ;  M  X  HMAX 
RUN  Basic 

S«IP;s(  <  (TRtR)l(NCOL)  )/v<  (TC+C)  IHCOL  )  )/EHD 
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DAT  A  4-Q 
-►LB 
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DAT  A+.CMSRE  AD 

LD; 'DO  TOU  DESIRE  ALL  OF  THIS  DATA* 

•TO  BE  PRESENTED  IN  THE  DRAFTSMAN' 
'DISPLAT  OR  JUST  A  5UBSAMPLE  OF  IT? 1 
■ EMTER  (ALL  OR  SUB)* 

RZ2«-D 

->  (  R  1  all  1  )/LC 

SUB  DATA 

LC  ;  •  DO  TOU  HAVE  A  TWO  DIMENSIONAL  1 
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PATAfRPOSH/[l]  MATRIX 
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C  1  4  3 


[13 
C23 
C  3  3 
[43 
C  5  3 
C  6  3 
[73 
C  8  3 
[93 
[103 
[113 
C  12  3 
C  133 
C  1 4  3 
[153 
C1A3 
[17  3 
[183 
[193 
C  20  3 
[213 


C  1  3 
C  2  3 
C  33 
C  4  3 
C53 
C  6  3 
C73 
C  8  3 
C  9  3 
C  103 
C  1  13 
C  12  J 
C  133 
C  1  4  3 
t  153 
C163 
t  173 
C  18  3 


<3  L06EL5  DOTAfIDXJI 

r,OTO) 
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•THE  COLUMN  LABELS  ARE  A  GLOBAL1 
•VARIABLE  CALLED  NAMES' 
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